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Reaulation of site-specific recoiabination by site-specific 
recoiubinase/nuclear receptor fusion proteins 

Specification 

The present invention relates to the technical field of genetic 
manipulation, and more specifically, to the use of recombina- 
tion-mediated DNA rearrangements and to the use of the regula- 
tory potential of nuclear receptors. 

The use of si tf--specif ic recombinases (SSRs) to induce defined 
rearrangements of DNA has been described in a variety of organ- 
isms (1-12) . These reports describe the introduction of a DNA 
construct that contains SSR target sites. Subsequent exposure 
to the SSR enzyme activity resulted in the DNA rearrangement 
determined by the disposition of the target sites (see refe- 
rence 13 for a recent review of SSRs) . Three SSRs have been 
used in this manner to date; FLP recombinase from the 2m epi- 
some of Saccharomyces cerevisiae (1,2,5,6,9,10), CRE recombi- 
nase from the Escherichia coli phage PI (3,4,8,11,12) and R 
recombinase from pSRl of Zygosaccharomyces rouxii (7) . Amongst 
other SSR systems relevant to the invention described here are 
those listed in references 13 and 14, and SSRs from Kluyveromy- 
ces drosophilarium (15), Kluyveromyces waltii (16), X Int (17) 
and the Gin recombination system from phage Mu (18) . The con- 
tent of the above document is incorporated by reference. 

For many applications in cells, organisms and cell -free in 
vitro systems, SSR induced DNA rearrangements must be regula- 
ted. Current implementation of the potential offered by SSRs is 
limited by the means available to regulate SSR activity. In 
experiments with cultured cells, unregulated SSRs have been 
used. For example, after introduction of SSR target sites into 



ded recombination event was regulated mereiy ny th- -ime oL 
introduction of an appropriate macromolecule . Amongst other 
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ranged target sites and the SSR and in which the recombination 
event can be induced after cell numbers have been expanded. 

In experiments with transgenic animals, the issue of SSR regu- 
lation has been addressed by regulating the expression of an 
introduced SSR gene using the inducible heat -shock promoter in 
Drosophila (5) or a tissue-specific promoter in mice (11) . Both 
approaches have limited applicability. Namely, heat-shock regu- 
lation of transgene expression is currently only useful in 
flies and no suitable counterpart is available for use in cell 
lines or vertebrates. Also, the use of a tissue-specific promo- 
ter to regulate transgene expression relies on the limited 
availability of suitable promoters and enhancers and the ex- 
pression pattern achieved is confined to the times and places 
at which these tissue specific elements are active. 

The problem underlying the present invention was to provide a 
regulated recombination system, wherein the disadvantages of 
the prior art are at least partially eliminated. More specifi- 
cally, the problem was to provide a recombination system, 
wherein the recombination event can be induced independently 
from any tissue specific restrictions. 

This patent application describes an invention that regulates 
SSR activity, rather than its expression. Thus any means of 
achieving and directing expression can be used, such as using 
ubiquitous or broadly active promoters and enhancers, as well 
as tissue specific or inducible promoters and enhancers. 

One aspect of the present invention relates to a fusion pro- 
tein, comprising a recombinase protein or a component of a re - 
combinase complex, fused to part or all of a nuclear receptor, 

scnce ot ^igana i^iriamu i.'i.> sa.. ^ j.igari.i o^iiaiiiu aoir.cii:. caiiu 
(b) recombinase activity is induced or altered by binding of 
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Preferably the recombinase activity is altered by binding of 
ligand to ligand binding domain by a factor of at least 10, 
more preferably of at least 20 and most preferably of at least 
40 . 

The present invention resides in the regulation of SSR activity 
by fusing an SSR protein to the ligand binding domain (LED) of 
a nuclear receptor. SSR-LBD fusion proteins comprise an amino 
acid sequence of SSRs physically attached to the amino acid 
sequence of an LED of a nuclear receptor. That is, the inven- 
tion is a fusion protein, comprising a recombinase protein, or 
a component of a recombinase complex, fused to the ligand bin- 
ding domain of a nuclear receptor. Preferably, the recombinase 
protein or a component of a recombinase complex is fused to the 
nuclear receptor or ligand binding domain thereof by means of 
genetic fusion, i.e. the SSR-LBD protein is a linear genetic 
fusion encoded by a single nucleic acid. On the other hand, the 
present invention also encompasses SSR-LBD fusion proteins 
which are linked by different means, e.g. through a spacer mo- 
lecule having reactive groups thereon, which are covalently 
bound to each protein domain. 

Most simply, the attachment of the SSR and LED components can 
be achieved by making a DNA construct that encodes for the ami- 
no acid sequence of the SSR-LBD fusion protein with the LBD 
encoding DNA placed in the same reading frame as the SSR enco- 
ding DNA, preferably either at the amino or carboxy termini of 
the SSR protein (19) . More preferably, the nuclear receptor or 
ligand binding domain thereof is fused to the C- terminus of the 
recombinase protein or component of a recombinase complex. In 
an especially preferred embodiment of the present invention the 
nuclear receptor or ligand binding domain thereof is fused to 



SSR-LBD fusion proteins can coexist with target sites without 
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ments, recombinase activity of the SSR-LBD fusion protein, in 
the absence of the relevant ligand, is at least 200x less ac- 
tive than wild type recombinase activity. Upon presenting the 
SSR-LBD fusion proteins with the relevant ligand, recombinase 
activity is induced to more than 20% of wild type, that is, 
equal to or greater than 4 Ox induction. This is the first 
description of post-transcriptional regulation of SSRs by any 
means. In particular, the invention permits the propagation of 
both the unrearranged target sites and the SSR-LBD in living 
systems. More specifically: 

(a) cell lines can be established that carry both the unrear- 
ranged target sites and the SSR-LBD as a prerequisite for bio- 
chemical studies or introduction into organisms. The cell lines 
can be homogeneous and characterised before expansion to the 
required quantities and subsequent induction of recombination 
by administration of the relevant ligand, 

(b) recombination can be regulated in any experimentally mani- 
pulatable multicellular organism by administration of the rele- 
vant ligand. The SSR-LBD could be introduced to the organism 
either through the incorporation of cells or by direct means 
such as microinjection or in the genome of a viral based vec- 
tor. Recombination can be induced after characterisation by 
administration of the relevant ligand. 

The term "nuclear receptor", as used herein, refers to a mole- 
cule, preferably a protein molecule, which may be glycosylated 
or unglycosylated, having the abilities to bind ligand and to 
be incorporated into a nucleus of a cell. Specifically, the 
term nuclear receptor refers to those proteins that display 
functional or biochemical properties that are similar to the 
functional or biochemical properties displayed by the steroid 
hormone receptors with respect to ligand binding, for example, 

' ' ' ' ' .- • ■ 

prot,eins thai, are related by tneir" ammo acid sequence to the 
LBDs of the steroid hormone receptors. The paper of Laudet et 
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ment of nuclear receptor amino acid sequences using standard 
methodologies. Included in the definition of nuclear receptors 
used here are also those proteins not listeci in Laudet et al . 
but which can be included using the methods employed by Laudet 
et al.. The term nuclear receptor also includes mutant deriva- 
tives of nuclear receptor amino acid sequences which retain 
sufficient relatedness to nuclear receptor amino acid sequences 
as to be identifiable as related using the methods employed by 
Laudet et al . 

The nuclear receptor which is fused to the recombinase protein 
is preferably a hormone receptor, e.g. a receptor recognized by 
steroids, vitamins or related ligands. Examples of suitable 
nuclear receptors are listed in reference 20, which is hereby 
incorporated by reference. Preferably, the nuclear receptor is 
a steroid hormone receptor, more preferably, a glucocorticoid, 
estrogen, progesteron, or androgen receptor. 

In the SSR-LBD fusion protein of the present invention, it is 
not required that the complete nuclear receptor is present; 
i.e. it is sufficient that the amino acids that bind the ligand 
are fused to the SSR. 



Upon binding their relevant ligand, nuclear receptors become 
active, or altered, transcription factors. The cloning of cDNAs 
encoding members of the steroid receptor family of proteins 
revealed that they share amino acid sequence homology, in 
particular in the protein domain that binds ligand. The LED can 
be separated from the rest of the protein and fused to other 
transcription factors conferring ligand regulation onto the 
resulting fusion proteins. For the glucocorticoid and estrogen 
receptors, the domain that binds ligand has been fused to other 



experimentally legulated iii this manner so rar are also Lrans 
cription factors. Transcription factors and oncoproteins are 
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26, is essentially similar to the role that LBDs play in regu- 
lating steroid receptors. Prior attempts to extend the regula- 
tory possibilities of the LBD fusion proteiji strategy beyond 
transcription factors and oncoproteins have yet to be reported. 
Attempts to regulate the enzyme activities of either fi-galacto- 
sidase or dihydrof olate reductase by fusing them with the 
glucocorticoid LBD have been unsuccessful (D. Picard, public 
seminar given at EMBL, Heidelberg, Nov. 1991) . 

The present invention extends the regulatory possibilities of 
the LBD fusion protein strategy beyond the currently documented 
transference of ligand regulation from nuclear receptors to 
other transcription factors, to include an enzyme activity, 
namely a site specific recombinase. The terra "site specific 
recombinase" refers to any protein component of any recombinant 
system that mediates DNA rearrangements in a specific DNA 
locus, including site specific recombinases of the integrase or 
resolvase/invertase classes (13; Abremski, K.E. and Hoess, R.H. 
(1992) Protein Engineering 5, 87-91; Khan, E., Mack, J.P.G., 
Katz, R.A., Kulkosky, J. and Skalka, A.M. (1991) Nucleic acids 
Res. 19, 851-860) and site-specific recombination mediated by 
intron- encoded endonucleases (Perrin, A., Buckle, M. and Dujon, 
B. (1993) EMBO J. 12, 2939-2947). Preferred recombinase pro- 
teins, which can be used in the fusion proteins according to 
the invention, are selected from the group consisting of: FLP 
recombinase, Cre recombinase, R recombinase from the Zygo- 
saccharomyces rouxii plasmid pSRl, A recombinase from the Kluy- 
veromyces drosophilarium plasmid pKDl, A recombinase from the 
Kluyveromyces waltii plasmid pKWl, any component of the X Int 
recombination system, any component of the Gin recombination 
system, or variants thereof. The term "variant" in this context 
refers to proteins which are derived from the above proteins by 



could retain tne ability to act as a recombinase, or it couia 
retain protein/protein or protein/DNA interactions critical to 
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In a preferred embodiment of the invention, FLP recombinase is 
fused to the LBD of the estrogen, glucocorticoid, progesterone 
or androgen receptors (20) . Other preferred embodiments include 
fusing Cre recombinase or R recombinase, or SSRs from Kluyver- 
omyces drosophilarium or Kluyveromyces waltii, to these LBDs. 
Another preferred embodiment involves regulating one or more 
components of an SSR complex to these LBDs, in particular, com- 
ponents of the X Int or Gin recombination systems. The inven- 
tion, in providing a means to regulate recombination, is how- 
ever not limited to known recombinases and recombination com- 
plexes and is not limited to known nuclear receptor LBDs. Rat- 
her, the strategy of fusing recombinases, or components of re- 
combination complexes, to LBDs of nuclear receptors is appli- 
cable to any fusion combination of these proteins which display 
the desired characteristics readily identifiable without undue 
experimentation on the part of a skilled person. 

A further subject-matter of the present invention is a nucleic 
acid which encodes the fusion protein according to the present 
invention. Preferably, the nucleic acid is a DNA or RNA. 

Still a further subject-matter of the present invention is a 
recombinant vector comprising at least one copy of the nucleic 
acid as defined above. This recombinant vector may be a euka- 
ryotic vector, a viral vector, or a prokaryotic vector, or a 
vector which can be maintained in eukaryotic and prokaryotic 
host cells. The recombinant vector is obtainable by inserting a 
nucleic acid encoding a SSR-LBD fusion protein into a suitable 
starting vector. Specific examples of suitable starting vectors 
are given for example in Molecular Cloning. A Laboratory Manu- 
al, 2nd edition, J. Sambrook et al . (1989), Cold Spring Harbor 
Laboratory Press, chapters 1, 2, 3, 4, 16 and 17. 



especially preferred. An example of a plasmid vector is the 
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gen receptor. The plasmid pHFEl has been deposited at the 
Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH 
(DSM) , Mascheroder Weg lb, D-38124 Braunschweig on June 25, 
1994 under the accession number DSM 9265 according to the 
requirements of the Budapest Treaty. 

Still a further subject-matter of the present invention is a 
cell containing a nucleic acid or a recombinant vector as defi- 
ned above. Preferably, the cell is stably transformed with the 
nucleic acid or the vector. Suitable cells are eukaryotic or* 
prokaryotic cells. Examples of prokaryotic cells are gram-nega- 
tive bacterial cells, especially E.coli cells. Examples of 
eukaryotic cells are mammalian cells, yeasts and trypanosomes . 

The invention separates the function of ligand binding from the 
functions of transcription factors, coupling ligand binding to 
recombinase activity. Thereby, the binding of ligand can be 
assessed by any means that measures recombinase activity. Thus, 
further embodiments of the invention include methods for deter- 
mining the binding of ligand to the LBD of a nuclear receptor, 
comprising the steps of: 

(i) the introduction of the SSR-LBD fusion protein or the nu- 
cleic acid coding therefor into cells, or appropriate cell -free 
systems, that contain the DNA target sites for the SSR; (ii) 
administering the ligand or a mixture suspected to contain a 
ligand or ligands to be evaluated; if the ligand is not already 
present in the system; (iii) detecting the recombinase activi- 
ty, if any, of the SSR-LBD by detecting, directly or indirect- 
ly, recombination or changes in the recombination rate between 
the DNA target sites. 

The introduction of SSR-LBD fusion proteins into cells may be 

i~ecessaiy m systems wriich already contain the ligand and in 
which the ligand-concentration is determined by detecting re- 
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Particularly preferred embodiments include: 

(a) direct measurements of the physical arrangement of the tar- 
get sites using techniques such as gel electrophoresis of DNA 
molecules, Southern blotting or polymerase chain reaction. 

(b) indirect measurements by assessing the properties encoded 
by the DNA regions carrying the target sites before or after 
recombination. For example, recombination could excise an anti- 
biotic resistance gene from the genome of the host and thus 
recombination can be measured as sensitivity to the antibiotic. 

(c) the measurement of intracellular liqand concentrations by 
evaluating recombination between the DNA target sites resident 
in the cells that also carry the SSR-LBD fusion proteins. 

(d) the evaluation of ligand binding to an LBD without the need 
to use radiolabelled ligand derivatives or without reliance on 
the transcription factor properties of the corresponding nu- 
clear receptor. 

(e) evaluating the effect of mutations in the LBD on ligand 
binding. Since SSR-LBD recombination requires ligand, mutations 
in the LBD that decrease ligand binding can be ascertained. 
Alternatively, mutations that improve binding of a different 
ligand can be selected. For example, ligand -dependent recombi- 
nation could rearrange the DNA, so that an antibiotic resis- 
tance gene is expressed. Cells with the rearranged DNA will 
grow under the appropriate antibiotic selection, cells with 
unrearranged DNA will not. Specifically this describes a method 
for determining the effect of mutations in the LBD of a nuclear 
receptor on its ability to bind ligand, comprising the steps of 
(a) introducing mutations into the LBD of the SSR-LBD fusion 
protein, and (b) following steps (i) to (iii) above. 

The invention also encompasses a method for regulating the re- 
combination of DNA target sites, comprising the steps of: 



cording ro the present invention, which contacting may be ac- 
complished by direct introduction of the protein or by trans- 

* — ■ -1 — * I .■ ■ ■■ . ■ " ' - \ • \ , . . , T • - , • T •■ • * . f ■ ' i' 
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(c) effecting a recombination of said DNA target sites by con- 
tacting the fusion protein with a ligand for the nuclear recep- 
tor component of the fusion protein, e.g. by adding the ligand 
to the system at a predetermined time, or by effecting produc- 
tion of the ligand in the system at a predetermined time. 

Description of the Figures 

Figure lA depicts the plasmid, pNEOSGAL (thin line) , integrated 
into the genome of 293 cells (thick line) . FLP recombinase tar- 
get sites (FRTs) are depicted as broad arrows and the neomycin 
resistance gene lies between the two FRTs. Upon estradiol admi- 
nistration, the DNA between the FRTs is excised, leaving one 
FRT in the genome and one on the excised circular DNA. The 
probe used for the Southern blot of Fig. 2 is also depicted. 

Figure IB depicts the steps involved in the creation of plasmid 
pHFEl . 

Figure 2 shows a Southern blot of a timecourse of recombination 
occurring in the cell line PI. 4. At the times indicated after 
estradiol, or ethanol only, administration in the upper part of 
the figure, cells were lysed, DNA purified, restricted with 
Ndel, run on a 1.25% agarose gel and blotted to Biodyne B mem- 
branes by standard methodologies. The membrane was hybridized 
with the radioactively labelled probe depicted in Fig. lA. The 
upper band shows the unrecombined integrated DNA and the lower 
the recombined band. Estradiol was dissolved in ethanol and the 
extreme right hand lane shows cells treated with the equivalent 
amount of ethanol, without estradiol, for 51 hours. 

Figure 3 shows the complete amino acid sequence of the SSR-LBD- 



amg domair; is located trorn ammo acid '\2'i-n^. 
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recombinase domain, the linker peptide and the estrogen binding 
domain, respectively, are specified. 

The following examples are offered by way of illustration and 
is not intended to limit the invention in any manner. 

Example 1 

Construction of plasmid containing a SSR-LBD- fusion gene 

The experiment involves two plasmids, one of which is the tar- 
get for recombination, and the other is the recombinase expres- 
sion plasmid. As the target plasmid, pNEOSGAL (9) was employed. 
pNEOfiGAL contains two FLP recombinase target sites (FRT) sur- 
rounding a gene encoding neomycin resistance (Fig. la) . The 
expression plasmid, pHFE 1, carries two genes, one encoding an 
FLP recombinase-estrogen receptor LBD fusion protein and the 
other encoding hygromycin resistance (Fig. lb). The plasmid was 
constructed by standard cloning procedures using pBluescribe 
(Stratagene) , pOG 44 (9), pHE63 (22) and pCNH2 (27). The stop 
codon of the FLP recombinase coding region present in pOG44 was 
mutated by oligonucleotide replacement to introduce BamHl, 
BsiWl and EcoRl sites and continue the open reading frame. The 
BamHI site in the estrogen receptor encoded by pHE63 was used 
to join the coding regions of FLP recombinase and estrogen 
receptor LBD. All of the estrogen receptor coding region car- 
boxy to the BamHI site is present in the plasmid, including its 
stop codon. 



Exainple 2 

Regulation of site specific recombination 

pNEOSGAL was introduced into 293 human embryonal kidney cells 
by electroporation (5 x 10^ cells in 500 /il phosphate-buf f ered 



stant cells were cloned ana character isca for incorporation ol 
target plasmid DNA by Southern blotting. A clone, PI, showing 
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the second plasmid, pHFE 1 (1 /xg of XmnI linearised pHFE 1 was 
precipitated by the standard calcium phosphate method onto 10 
PI cells which were cultured in DMEM without phenol red, 10% 
charcoal stripped fetal calf serum) . Four clones resistant to 
both neomycin and hygromycin (0.4 mg/ml G418, 0.4 mg/ml hygro- 
mycin B) were isolated and the dependance of recombination on 
estradiol administration was observed in all four. For clone 
PI. 4, a time course of recombination in the presence of 10"* M 
estradiol is shown (Fig. 2) . 
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Claims 

1. A fusion protein, comprising a recombinase protein or a 
component of a recombinase complex, fused to part or all of 
a nuclear receptor so that the amino acids that bind the 
ligand of said nuclear receptor are included, such that, in 
cells or appropriate cell-free systems: 

(a) recombinase activity is inhibited in the absence of 
ligand binding to said ligand binding domain, and 

(b) recombinase activity is induced or altered by binding of 
ligand to said ligand binding domain. 

2. The fusion protein of claim 1, 

wherein the nuclear receptor is a hormone receptor. 

3. The fusion protein of claim 1 or 2, 

wherein the nuclear receptor is a steroid hormone receptor. 

4. The fusion protein of any one of the claims 1-3, wherein the 
nuclear receptor is a mutated derivative of a nuclear 
receptor such that it retains the characteristics of the 
fusion protein of claim 1. 

5. The fusion protein of any of the claims 1-4, 

wherein the nuclear receptor is a vertebrate glucocorticoid, 
estrogen, progesteron or androgen receptor. 

6. The fusion protein of any one of the claims 1-5, 

wherein the recombinase protein or component of a recombi- 
nase complex is selected from the group consisting of: FLP 
recombinase, Cre recombinase, R recombinase from the Zygo- 
saccharomyces rouxii plasmid pSRl, A recombinase from the 
Kluyveromyces drosophilarium plasmid pKDl, A recombinase 



7. The fusion Drot.ein of any of claims 1-6, 
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wherein the recombinase protein or compound of a recombinase 
complex is fused to the nuclear receptor or ligand binding 
domain thereof by means of a genetic fusion. 

8. The fusion protein of any of claims 1-7, 

wherein said nuclear receptor or ligand binding domain the- 
reof is fused to the N- or C- terminus of said recombinase 
protein or component of a recombinase complex. 

9. The fusion protein of any of claims 1-8, 

wherein said nuclear receptor is fused to said recombinase 
protein or component of a recombinase complex through a 
peptide linker. 

10. The fusion protein of any of claims 1-9, 

which comprises the FLP recombinase and the ligand binding 
domain of the estrogen receptor. 

11. A nucleic acid which encodes the fusion protein of any one 
of the claims 1-10. 

12. The nucleic acid of claim 11, which is a DNA or RNA. 

13. A recombinant vector comprising at least one copy of the 
nucleic acid of claims 11 or 12. 

14. The vector of claim 13, which is a plasmid. 

15. The plasmid pHFEl (DSM 9265) 

16. A cell containing a nucleic acid of claims 11 or 12 or a 

recombinant vector of any one of the claims 13-15. 



ligand binding domain of a nuclear receptor, comprising the 
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(a) the introduction of the fusion protein of claims 1-9, or 
the nucleic acid of claims 10 or 11 into cells, or appropri- 
ate cell -free systems, that contain the DNA target sites for 
recombinat ion , 

(b) optionally administering the ligand or a mixture suspec- 
ted to contain a ligand or ligands, to be evaluated, 

(c) detecting the recombinase activity, if any, of said 
fusion protein by detecting recombination or changes in the 
recombination rate between said DNA target sites. 

19. The method of claim 18, 

wherein the recombination between said DNA target sites is 
detected by direct measurement of the physical arrangement 
of said target sites. 

20. The method of claim 18, 

wherein the recombination between said DNA target sites is 
detected by assessing the properties encoded by the DNA 
regions carrying the target sites before or after recombina- 
tion. 

21. A method for determining the effect of mutations in the 
ligand binding domain of a nuclear receptor on its ability 
to bind ligand, comprising the steps of: 

(a) introducing mutations into the ligand binding domain of 
the fusion protein of claims 1-10, 

(b) following steps (a) to (c) of claim 18. 

22. A method for regulating the recombination of DNA target 
sites, comprising the steps of: 

(a) providing cells or appropriate cell -free systems that 
contain DNA target sites for a site specific recombinase. 



tacting the tusior; protein wit.h a . i ganc nuceri: 
receptor component of said fusion protein. 
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FIG. IB 
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FIG. 2 
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Fig. 3 

FLP/EBD AMINO ACID SEaUENCE AS ENCODED BY PHFEl 
1 MPQFDILCKT PPKVLVRQFV ERFERPSGEK lALCAAELTY LCWMTTHNGT 
5 1 AIKRATFMSY NTHSNSLSL DIVNKSLQFK YKTQJCATILE ASLKKUPAW 
101 EFniPYYGClKHQ^DITDIV SSLQJjQpESS EEADKGNSHS KKMLKALLSE 
, , ^y,^ .rr^rjjr^ aj-^SFEYTSR FTKTKTLYQF LPLATFTNCG RFSDIKNVDP 



ID J. vjri:oiv¥xii 

201 KSFKLVQNKY LGVnaCLVT ETKTSVSRHI YFFSARGRID PLVYLDEFLR 
25 1 NSEPVUKRVN RTGNSSSNKaEYQUKDNLV RSYNKALKKN APYSIFAIKN 
301 GPKSHIGRHL MTSFLSMKGL TELTNWGNW SDKRASAVAR TTYTHQJTAI 
35 1 PDHYFALVSR YYAYDPISKE MIALKDETNP lEEWQfnEQL KGSAEGSIRY 
401 PAWNGIISQ.E VLDYLSSYIN RRI 

- FLP ENDS HERE - 
424 SVRGS 

- LINKER PEPTIDE ENDS HERE - 

429 MK GGIRKDRRGG RMLKHKRQRD 

45 1 DGEGRGEVGS AGDMRAANLW PSPIMIKRSK KNSLALSLTA DQMVSALLDA 
501 EPPHYSEYD PTRPFSEASM MGULTNLADR ELVHMINWAK RVPGFVDLTL 
551 HDQVHLLECA WLEILMIGLV WRSMEHPVKL LFAPNLLLDR NQGKCVEGMV 
601 HFDMLLATS SRFRMMNLQG EEFVCLKSII LLNSGVYTFL SSTLKSLEEK 
65 1 DHIHRVLDKI TDTLIHLMAK AGLTLQQQHQ. RLAQLIilLS HIRHMSNKGM 



- ESTROGEN BINDING DOMAIN ENDS HERE - 
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Fig. 4/1 

CODING SEQJJENCE OF THE FLP/EBD FUSION PROTEIN ENCODED BY 
PHFEl 

ATGCCACAATXTGATATATIATGTAAAACACCACCTAAGGTGCTTGITCGT 

CAGTITGTGGAAAGGTrTGAAAGACCTTCAGGTGAGAAAATAGCATTATG 

TCCTGCTGAACTAACCTATTTATGTTGGATGATrACACATAACGGAACAGC 

AATCAAGAGAGCCACATrCATGAGCTATAATACTATCATAAGCAATTCGC 

TGAGTrCCGATATTGTCAACAAGTCACTGCAGTTTAAATACAAGACGCAA 

AAAGCAACAATTCTGGAAGCCTCATTAAAGAAATTGATTCCTGCTTGGGA 

ATTTAC^ArrArrCClTACTATGGACAAAAACATC^^ 

ATATTGTAAGTAGnTGCAATTACAGTTCGAATCATCGGAAGAAGCAGAT 

AAGGGAAATAGCCACAGTAAAAAAATGCTTAAAGCACTTCTAAGTGAGG 

GTGAAAGCATCTGGGAGATCACTGAGAAAATACTAAATTCGTTTGAGTAT 

ACTTCGAGATTTACAAAAACAAAAACTTTATACCAATTCCTCTTCCTAGC 

TACnTCATCAATTGTGGAAGATTCAGCGATATTAAGAACGTTGATCCGA 

AATCAnTAAATTAGTCCAAAATAAGTATCTGGGAGTAATAATCCAGTG 

TTrAGTGACAGAGACAAAGACAAGCGTTAGTAGGCACATATACTTCTTTA 

GCGCAAGGGGTAGGATCGATCCACTTGTATATTTGGATGAATTTTTGAGGA 

ATTCTGAACCAGTCCTAAAACGAGTAAATAGGACCGGCAATTCTTCAAGC 

AACAAGCAGGAATACCAATTATTAAAAGATAACTTAGTCAGATCGTACA 

ACAAAGCTITGAAGAAAAATGCGCCTTATTCAATCnTGCTATAAAAAA 

TGGCCCAAAATCTCACATTGGAAGACATTTGATGACCTCATTTCTTTCAAT 

GAAGGGCCTAACGGAGTTGACTAATGTTGTGGGAAATTGGAGCGATAAGCG 

TGCTTCTGCCGTGGCCAGGACAACGTATACTCATCAGATAACAGCAATACCT 

GATCACTACTrCGCACTAGTTTCTCGGTACTATGCATATGATCCAATATCA 

AAGGAAATGATAGCATTGAAGGATGAGACTAATCCAATTGAGGAGTGGC 

AGCATATAGAACAGCTAAAGGGTAGTGCTGAAGGAAGCATACGATACCCC 

GCATGGAATGGGATAATATCACAGGAGGTACTAGACTACCTTTCATCCTAC 

ATAAATAGACGCATA 

-FLP ENDS HERE- 
TCCGTACGCGGATCC 

- SYNTHETIC UNKER SEaUENCE ENDS HERE - 

ATGAAAGGTGGGATACGAAAAGACCGAAGAGGAGGGAGAATGTTGAAAC 
ACAAGCGCCAGAGAGATGATGGGGAGGGCAGGGGTGAAGTGGGGTCTGCTG 



j^., ^•^^ f^^- 



1 i C 'Xu 1 Lt.'XAUl. i"l >^'oA i ^-A 1 o^L.^ - i A.. . v). . A.M. . . ■ ■ 

TGGnXACATGATCAACTGGGCG.^GAGGGTGCCAGGCTrrGTGGA-ITTGAC 



SUBSriflHE SHEET (RULE 261 



wo 95/00555 



6/6 



PCT/EP94/02088 



Fig. 4/2 



CCTCCATGATCAGGTCCACCTTCTAGAATGTGCCTGGCTAGAGATCCTGATG 

ATTGGTCTCGTCTGGCGCTCCATGGAGCACCCAGTGAAGCTACTGTTTGCTCCT 

AACTTGCTCTTGGACAGGAACCAGGGAAAATGTGTAGAGGGCATGGTGGAG 

ATCTTCGACATGCTGCTGGCTACATCATCTCGGTTCCGCATGATGAATCTGCA 

GGGAGAGGAGTTTGTGTGCCTCAAATCTATTATTTTGCTTAATTCTGGAGTG 

TACACATTTCTGTCCAGCACCCTGAAGTCTCTGGAAGAGAAGGACCATATCC 

ACCGAGTCCTGGACAAGATCACAGACACTTTGATCCACCTGATGGCCAAGGC 

AaGCrTGArr.CTGCAGCAGCAGCACCAGCGGCTGGCCCAGCTCCTCCTCATCCT 

cfcCCACATCAGGCACATGAGTAACAAAGGCATGGAGCATCTGTACAGCAT 

GAAGTGCAAGAACGTGGTGCCCCTCTATGACCTGCTGCTGGAGATGCTGGAC 

GCCCACCGCCTACATGCGCCCACTAGCCGTGGAGGGGCATCCGTGGAGGAGAC 

GGACCAAAGCCACTTGGCCACTGCGGGCTCTACTTCATCGCATTCCTTGCAAA 

AGTATTACATCACGGGGGAGGCAGAGGGTTTCCCTGCCACAGTCTGA 

- HORMONE BINDING DOMAIN ENDS HERE - 
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